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CODING OF MACHINE READABLE CHARACTERS 
( MICR AND OCR ) FOR INFORMATION PROCESSING 

[ ISO Title : Information Processing — Coding of Machine 
Readable Characters ( MICR and OCR ) 3 



National Foreword 

This Indian Standard, which is identical with ISO 2033-1983 ' Information processing — 
Coding of machine readable characters ( MICR and OCR )', issued by the International 
Organization for Standardization (ISO ), was adopted by the Indian Standards Institution on the 
recommendation of the Computers, Business Machine and Calculators Sectional Committee and 
approved by the Electronics and Telecommunication Division Council. 

In this standard, certain terminology and conventions are not identical with those used 
in Indian Standards; attention is specially drawn to the following : 

Wherever the words ' International Standard ' appear, referring to this standard, they 
should be read as ' Indian Standard '. 



Cross Reference 

International Standard 

ISO 646 Information processing — 7-bit coded 
character set for information interchange 



ISO 1004- 1977 Information processing — 
Magnetic ink character recognition — 
Print specifications 



Corresponding Indian Standard 

IS : 10315-1982 7-btt coded character set for 
information interchange 
(Identical) 

IS : 11179-1985 Print specification for magnetic 
ink character recognition for information 
processing 
(Identical) 



The Computers, Business Machines and Calculators Sectional Committee has reviewed 
the provisions of the following ISO Standards and has decided that they are acceptable for use 
in conjunction with this standard : 

ISO 1073 Alphanumeric character sets for optical recognition : 

Part 1 Character set OCR-A — Shapes and dimensions of the printed image 

Part 2 Character set OCR-B — Shapes and dimensions of the printed image 



ISO 2022 Information processing 
extension techniques 



ISO 7-bit and 8-blt coded character sets — Code 
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1 Scope 

This International Standard defines the coded rispresentation of 
printed characters recognized by reading equipment. It in- 
cludes the fonts : 

E13B as covered in ISO 1004 

CMC 7 as covered in ISO 1004 

OCR-A as covered in IS0 1073/ 1 

OCR-B as covered in ISO 1073/? 



2 Field of application 

This International Standard assigns bit-patterns to characters 
recognized by reading equipment. This coded information 
generated by the reading equipment is given to the recipient by 
different media, such as magnetic tape, by data transmission or 
a direct link. This coded representation can also be used by 
printing devices to print the information which shall later be 
read. It is not intended for general information interchange. 

Two different applicatidns are considered : 

— Single-font reader; 

The reading equipment is only capable of recognizing 
one font at a time. 

— Multiple-font reader: 

The reading equipment is capable of recognizing mul- 
tiple fonts at the same time. 



3 References 

ISO 646, information processing — 7-bit coded character set 
for information interchange. 

ISO 1004, Information processing — Magnetic inii character 
recognition — f^int specifications. 

ISO 1073, Aiphanumeric character sets for optical recognition 

— F^rt 1: Character set OCR-A — Shapes and dimensions 
of the printed image. 

— F'art 2: Character set OCR B — Shapes and dimensions 
of the printed image. 

ISO 2022, Information processing — /SO 7-bit and 8-bit coded 
character sets — Code extension techniques. 



4 Coding 

The coding given in this International Standard is based on the 
7-bit code described in ISO 646 and on its extension to 8 bits 
according to ISO 2022. 

The empty positions in code tables 1 to 5 are reserved for future 
standardization. 

This International Standard does not define the character set to 
be read by the reading equipment. 

Two codings are shown. The 8-bit coding is primarily intended 
for use with multi-font readers in the case where the 7-bit 
coding is not sufficient to represent the needed characters. 

Independent of the coding shown in this International Stan 
dard, the code extension techniques given in ISO 2022 are ap- 
plicable, i.e. the 7-bit coding shown in this International Stan 
dard may be transformed into 8-bit coding and the 8-bit coding 
shown in this International Standard may be transformed into 
7-bit coding according to the rules of ISO 2022. Furthermore, 
the characters of columns 10 to IFi can equally be designated as 
a G1 , G2 or G3 set. 

References to positions of tables 1 to 5 in tables 6 to 9 are given 
by the notation "column number/row number". The column 
numbers for 7-bit coding consist of one digit and those for 8-bit 
coding consist of two digits. The notations b^ to bg refer to the 
7 bits or 8 bits of the coding whereby, b^ is the low order bit. 

Example 

Capital letter F is shown in position 4/6 of the 7 bit table and 
04/6 of the 8-bit table. This corresponds to bit pattern 1000110 
and 01000110 respectively. 

4.1 7-bit coding 

The 7-bit coding can be used whenever the number of 
characters shown is sufficient for the application. This coding 
can also be used within an 8- bit environment by adding an 
eighth bit with the value 0, as defined in ISO 2022. 

4.2 8-bit coding 

The B-bit coding can be used whenever the number of 
characters in a 7-bit table is insufficient for the application. The 
8-bit coding can also be used within a 7-bit environment as 
defined in ISO 2022. 
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Table 1 — 8-bit coding of the characters of all fonts 
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NOTE — The empty positions are reserved for future standardization. 
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Table 2 — 7-bit coding of the characters of the CMC 7 font 
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Table 3 — 7-bit coding of the characters of the OCR-A font 
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NOTE — The empty positions are reserved for future standardization. 
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Table 4 — 7-bit coding of the characters of the OCR-B font 
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NOTE — The empty positions are reserved for future standardization. 
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Table 5 — 7-bit coding of the characters of the E 13 B font 
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NOTE -' The empty positiorts are reserved for future standardization. 
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5 General considerations 

5.1 End of line 

If the information read by the equipment is structured in lines 
and if this structure should be maintained two possibilities are 
given : 

— if the information is handled on a basis of records, then 
every line will form one record; 

— if the information is nanaied character- by-character 
(data stream) , the end of a line wilt be coded by means of 
control characters CR and LF. 

5.2 Characters in error 

If a character is recognized but cannot be identified as one 
character of the character set, control character SUB will be 
coded. 



6 Font CMC 7 

The characters of the CMC 7 font will be assigned to the pos- 
itions of the code tables as specified in table 6 (see also tables 
1 and 2). 



Table 6 — Characters of the CMC 7 font and their 
assignment to positions in the code tables 



Table 7 — Characters of the OCR-A font and their 
assignment to positions in the code tables 



Characters of CMC 7 


S-bit 
code 


7- bit 
code 




Digits to 9 


03/0 to 03/9 


3/0 to 3/9 




Capita) letters A to Z 
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7 Font OCR-A 

The characters of the OCR-A font will be assigned to the pos- 
itions of the code tables specified in table 7 (see also tables 
1 and 3). 



7.1 Erase characters 

Erase characters wil! normally be ignored by the reading equip- 
ment. If there is a requirement to code these characters, control 
character DEL shall be coded. For -the character Group Erase 
one or more DEL may be coded 



Characters of OCR-A 



8-bit 
code 



7-blt 
code 



Digits to 9 
Capital tetters A to Z 

■ Full stop (period)* 

T Comma* 

— Equals sign 

+ Plus sign 

^ Hyphen, minus sign* 

/ Solidus 

)|( Asterisk 

ll Abstract symbol HI (hook) 

*Y Abstract symbol H2 (fork) ' 

n Abstract symbol H3 (chair) 

I Abstract symbol H4 

I (long vertical mark) 

H Character erase* 

i^""" Group erase* 

^ Capital letter A 

A Capital letter A 

Jfl Capital letter >C 

^ Capital letter ^ 

}ff Capital letter 

Capital letter 

U Capital letter 

a Colon 

■ 

"^ Semicolon 

r Question mark* 



03/0 to 03/9 
04/1 to 05/10 

02/14 

02/12 

03/13 

02/11 

02/13 

02/15 

02/10 

11/12 

11/13 

11/14 
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07/15 

07/15 
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14/1 

14/8 
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14/10 

14/13 
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03/11 
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3/0 to 3/9 
4/1 to 5/10 

2/14 
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2/13 

2/15 
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5/13 
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7/12 

7/15 

7/15 



3/10 
3/11 
3/15 



See Special description in the following clauses. 



IS : 11203-1985 
ISO 2033-1983 



Table 7 — {concluded) 



Characters of OCR-A 


8-bit 
code 


7-bit 
code 


Ff Quotation mark 


02/2 


2/2 


f Apostrophe * 


02/7 


2/7 


•f Left curly bracket 


07/11 


2/8 


J Right curly bracket 


07/13 
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S Dollar sigr^ 
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Table 8 — Characters of the OCR-B font arid theie 
assignmer>t to positions in the code tables 



See special description in the following clauses. 



7.2 Alternative characters 



If there are alternative representations of one character, all of 
them will be assigned to the same position. This applies to 
characters Full stop. Comma, Hyphen, Question mark and 
Apostrophe. 



7.3 Abstract symbol H2 (fork) 

In general, this symbol is not used in conjunction with the 
alphabetic characters because of its potential interference with 
the letter Y. 

However, when it is possible to recognize both characters cor- 
rectly, the indicated coding shall be used. 



8 Font OCR-B 

The characters of the OCR-B font will be assigned to the pos- 
itions of the code table as specified in table 8 {see also tables 
1 and 4). 



Characters of OCR-B 

(nun^ber according to 

ISO 1073) 



I to 10 digits to 9 

I I to 36 Capital letters A to 2 
37 to 62 Small letters a to z 



8-bit 
code 



7- bit 
oode 



03/0 to 03/9 
04/1 to 06/10 
06/1 to 07/10 



63 ^ Asterisk 02/10 

64 4" Plus sign 02/11 

65 "" Hyphen (minus sign) i 02/13 

i 

66 r; Equals sign \ 03/13 

67 / Solidus 



68 " Full stop Iperiod) 

69 / Comma 



70 



71 



Colon 



Semicolon 



72 II Quotation mark 



Exclamation mark 
Left parenthesis 






73 I Apostrophe 

Discontinuous 

74 ■" underline * 

75 r Question mark 

76 I 

II 

7^ ( 

78 1 Right parenthesis 

79 ^ Less than sign 
flO ^ Greater than sign 

81 I Left square bracket 

82 J Right square bracket 

83 / Percent sign 

84 it Number sign 
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02/14 
02/12 
03/10 
03/11 
02/2 

Gin 
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03/15 
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02/9 
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4/1 to 5/10 
6/1 to 7/10 
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* See special description in the following clauses. 
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Table 8 — {continued) 



Tables ~ (concluded) 



Characters of OCR-B 

(numbers according ta 

ISO 1073) 


8-bit 
code 


7-bit 
code 




Characters of OCR-B 

(numbers according to 

ISO 1073) 


8-bit 
code 


7bit 
code 


85 Of Ampersand 


02/6 


2/6 




106 13 Small letter S 


15/11 


- 


86 q) Commercial at 


04/0 


4/0 




107 •• Diaeresis" 


- 


- 


87 A Upward arrow htead 


05/14 


5/14 




•I 

107a 2 Small letter g 


15/2 


- 


88 XX Currency sign 


02/4 


2/4 




107b Q Small letter 


15/10 


- 


89 ^ Pound sign 


10/3 


- 




107c U Small letter u 


15/13 


- 


90 $ Dollar sign 


10/4 


- 




108 ' Acute accent* 


- 


- 


91 1 Vertical line* 


07/12 


7/12 




109 "* Grave accent* 


- 


- 


92 1 Long vertical line* 


07/12 


7/12 




110 '^ Circumflex accent* 


- 


- 


93 \ Revers solidus 


05/12 


5/12 




111 "' Tilde* 


- 


- 


94 /\ Capital letter A 


14/2 


~ 




112 1 Cedilla* 


- 


- 


95 f^ Capital letter A 


14/0 


_ 




113 -^ Left curly bracket 


07/11 


7/11 


96 ^ Capital letter /E 


14/1 


_ 




114 "V Right curly bracket 


07/13 


7/13 


97 O f'-^P**' l««»^ 


14/10 






Altemettve 
115 III small letter m* 






^m 








06/13 


6/13 


98 Capital tetter 


14/9 


— 




Continuous 






■ m 








116 -— underline* 


— 


- 


99 y Capital tetter 


14/13 


~ 










100 1 J Capha) letter IJ 








117 Space* 


02/0 


2/0 


14/6 


- 




Paragraph 
118 3 (clause sign) 






ffv 








10/7 


- 


101 ^ Capital letter f\ 


14/8 













102 g Small letter 6 








119 "IP Yen sign 


10/5 


_ 


15/0 





















120 m Character erase* 


07/15 


7/15 


103 gg Small letter m 


15/1 






121 m t(n m, Group erase* 


07/15 


7/15 


104 Small letter 

• • 
106 1 J Small letter ij 


15/9 
15/6 














* See special description in the fol 


lowing clauses 
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8.1 Underline characters 

Two characters are provided for underlining 
No. 74 Discontinuous underline 
No. 116 Continuous underline 

The latter, Continuous underline is not intended for use in OCR 
applications. The character Discontinuous underline shall be 
used in OCR applications as a free-standing character only, and 
shall not be printed under another character. It shall be coded 
as indicated. 

8.2 Diacritical marks 

The characters 

No. 107 Diaeresis 

No. 108 Acute accent 

No. 109 Grave accent 

No. 110 Circumflex accent 

No. Ill Tilde 

No. 112 Cedilla 

will be used as a diacritical mark combined with a, letter. In 
general, reading equipment recognizes such a combined 
character as one single character and will code it as one single 
character. If such a combined character is required and it is not 
contained in the code table as a single character, it shall be in- 
cluded in future revisions of this International Standard. 

8.3 Space character 

If the character space (No. 117) is recognized by the reading 
equipment, it may be coded as one or more characters SP. 

8.4 Erase characters 

Erase characters {No. 120 and 121) will normally be ignored by 
the reading equipment. If there is a requirement to code these 
characters, control character DEL is to be coded. For the 
character Group Erase (No. 121) one or more DEI. may be 
coded. 



8.5 Small letter m 

The characters 

No. 49 Small letter m 

No. 115 Alternative smaH letter m 

represent the same character and are coded in the same pos- 
ition in the code table. 



8.6 Vertical line 

The characters 

No. 91 Vertical line 
No. 92 Long vertical line 

represent the same characters and are coded in the same pos- 
ition in the code table. 



9 Font E 13 B 

The characters of the E 13 B font wilt be assigned to the pos- 
itions in the code table as specified in table 9 (see also tables 
1 and 5) 



Table 9 — Characters of the E 13 B font and their 
assignment to positions in the code tables 



Characters of E 13 B 


8-bit 
code 


7-bit 
cods 


Digits to 9 
Symbol 1 ■ ■ 
Symbol 2 |' 
Symbols 41 
Symbol 4 III 


03/0 to 03/9 

13/10 

13/11 

13/12 

13/13 


3/0 to 3/9 

3/10 

3/11 

3/12 

3/13 
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Annex 
Main differences between ISO 2033-1972 and the present edition 

(This annex does not form part of the standard.) 



The main differences between the first and second editions of this International Standard are the following: 

a) the 4-bit coding has been removed ; 

b) the character Vertical line of font OCR-A is now coded in position 7/12 in 7-blt to be in accordance with the IRV of ISO 646; 

c) the characters of font E 13 B have been added; 

d) a coding for multi-font readers has been added; 

e) the alternative assignments of OCR-B characters in positions 2/3 and 2/4 have been removed to be in accordance with the IRV of 
ISO 646. 
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